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GAUSSIAN LIMITS FOR GENERALIZED SPACINGS 

By Yu. Baryshnikov, Mathew D. Penrose and J. E. Yukich-*^ 
Bell Laboratories, University of Bath and Lehigh University 

Nearest neie hbor cells in R'^,deN, are used to define coefficients 
of divergence (</)- divergences) between continuous multivariate sam- 
ples. For large sample sizes, such distances are shown to be asymp- 
totically normal with a variance depending on the underlying point 
density. In d=l, this extends classical central limit theory for sum 
functions of spacings. The general results yield central limit theorems 
for logarithmic fc-spacings, information gain, log-likelihood ratios and 
the number of pairs of sample points within a fixed distance of each 
other. 

1. Introduction. Suppose ^(j), 1 < < ?^, are the order statistics drawn 
from an i.i.d. sample with distribution F on M and let G be a distribution 
function. Classical spacing functionals on M (Section 6 of [35]) take the form 
of an empirical cp-divergence 

n-l 

(1-1) ^</.(n[G(X(,,+i))-G(X(,))]), 

i=l 

where (p '■ ^ M is a specified function and where typically F is unknown. 
When F and G have densities / and g, respectively, the functionals (1.1) 
represent an empirical version of the (j)-divergence of g from /, namely 
/ f{x)(l){j^)dx. The (/(-divergence functional, introduced by Ali and Sil- 
vey [1, 2, 3] and independently by Csiszar [9, 10, 11] is a measure of the 
discrepancy of G relative to F. Empirical (/>-divergences are widely used 
in nonparametric estimation and are well suited for goodness-of-fit tests 
[7, 8, 15, 22, 34, 36, 42]. 

This paper has two main goals. The first is to use kth nearest neighbor 
cells to establish high-dimensional analogs of the (/)-divergences (1.1). The 
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nearest neighbor cells are employed to define the statistical discrepancy of 
a proposed distribution with density g relative to an observed i.i.d. sample 
drawn from a distribution with density /. We establish a general central limit 
theorem (CLT) showing that the resulting distance functionals converge to 
a normal random variable whenever / and g are bounded away from zero 
and infinity. The limiting variance is given in terms of the V^^fc-divergence 
and A^^fc-divergence of g from /, where V^^k and A^^^ are certain integral 
transforms of (j). 

Our second goal is to use ^-divergences based on kth nearest neighbors 
cells to provide a unifying approach toward proving classical central limit 
theorems for sum functions of /c-spacings [7, 8, 12, 13, 18, 40, 42]. This yields 
asymptotic normality for information gain, log-likelihood ratios and sums of 
logarithmic spacings whenever the densities of F and G are bounded away 
from zero and infinity. The methods extend to yield a central limit theorem 
for the number of pairs of sample points within a fixed distance. 

More generally, we consider the natural random measures associated with 
the empirical i;^)-divergences, obtained for d = 1 by putting a point mass at 
each of size equal to the ith term in (1.1), and analogously for d > 1. 
We show that these point measures, when acting on bounded test functions, 
and when suitably centered and scaled, converge weakly to a Gaussian field. 

Our approach uses stabilization methods, a tool [5, 31, 32, 33] for estab- 
lishing general limit theorems for sums of weakly dependent terms in geo- 
metric probability. These methods quantify local dependence in ways useful 
for establishing thermodynamic and Gaussian limits and they also show that 
locally defined functionals of Poisson points on large bounded sets can be 
well approximated by globally defined functionals of homogeneous Poisson 
points on all of W^. This latter feature conveniently often leads to explicit 
thermodynamic and variance asymptotics. 

Existing general limit results cannot be applied directly to the high- 
dimensional analogs of (1.1). However, it turns out that the empirical (j)- 
divergences nonetheless involve sums of stabilizing functionals, and one might 
thus expect that the underlying ideas and methods at the heart of stabiliza- 
tion are applicable and lead to variance asymptotics and Gaussian limits for 
the high-dimensional analogs of (1.1). This paper shows that this is indeed 
the case. Further, by adapting the methods of [32] to the present setting, we 
may prove variance asymptotics and central limit theorems over point sets 
with a fixed (non-Poisson) number of points. 

Our general results are stated in Section 2; applications associated with 
particular choices of (j) are discussed in Section 3. Because of their generality, 
our main results have lengthy proofs, which we provide (in Section 4) with 
some details omitted; for full details, see the extended version of this article 
[4]. 
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2. Main results. 

2.1. Preliminaries. 

Notation. We use the following notation throughout. If is a Borel 
subset of M'^, then \B\ denotes its Lebesgue measure. Given X C M"^, a > 
and y G M'^, let y + aX := {y + ax : x £ X}. For x £ M*^, let \x\ be its 
Euclidean modulus and for r > 0, let Br{x) denote the open Euclidean ball 
{y £R'^:\y - x\ < r}. Let denote the origin of R'^, and let uJd ■= |-Bi(0)| = 
n'^/'^ /T{{d/2) + 1). We use logx to denote the natural logarithm of x. 

We let / and g denote two probability density functions on {d G N) 
with common compact support, which we assume is convex and which is 
denoted by A. We assume once and for all that f and g are hounded and 
that they are hounded away from zero on A. Abusing notation, we let F{-) 
[resp. G{-)] denote the probability measure on R'^ with density / (resp. g), 
that is, F{B) := f^f{x)dx and G{B) := jQg{x)dx. 

Throughout Xi,X2,... denotes a sequence of independent random d- 
vectors with common density /. Let Xn := {-'^i, • • • j^n}- Also, given A > 0, 
let V\ be a Poisson point process in A with intensity function Xf -.A^ M+. 
For all a > 0, let Tia denote a homogeneous Poisson point process on 
with intensity a. We write 7i for Tii. 

Given a Borel subset E <ZW^, let B{E) denote the class of bounded Borel- 
measurable real-valued functions on E. Given h G S(M'^), we write ||/i||oo for 
suPa;giRd(|/i(x)|) and given also ^ a Borel measure on R'^, we let denote 
the integral of h with respect to 

We shall consider (/)-divergences and related quantities for a general class 
J- of functions cj), which we now describe. Let := (0,oo). Given a contin- 
uous function ^iM"*" — > M, define the function 0* :]R+ — > [0,oo) by 

,*/,N._/sup{|(^(u)|:t<u<l}, ifO<t<l, 
^ > ^ ^ ' -~ \ sup{|(^(s)| : 1 < s < t}, if t > 1. 

In other words, (f)* is the minimal function on with the properties that 
(i) —(/>*(•) is unimodal with a maximum at 1, and (ii) dominates |(^(-)| 
pointwise. 

Let J- be the class of continuous functions : M"*" M such that the re- 
striction to (0, 1) of the function (f)* defined by (2.1) is square-integrable on 
(0, 1), and such that log(max((/)(t), 1)) = o{t) as t — > oo. Let be the class 
of functions in T which are bounded on (0, 1]. 

Let Fi denote a gamma(l, 1) random variable, that is, let Fi be exponen- 
tially distributed with mean one. Letting Fi^j, i > 1, be independent copies 
of Fi, we put Ffc := Y^^=i'^i,i) ^ gamma random variable with parameters 
k and 1. For o"^ > 0, let N{0,a'^) denote a normal random variable with 
mean zero and variance o"^ . Given random variables X, Y we write X 
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(or Yy X) iiY dominates X stochastically, that is, if P[X <t]> P[Y < t] 
for all t G M. 

2.2. High- dimensional (f)- divergence based on k-nearest neighbor cells. Let 
K, be an open convex cone in (a cone is a set that is invariant under di- 
lations). For all r > 0, let B^{x) := x + (/C H Br{0)). Recall that the aspect 
ratio of a subset E of M*^ is the ratio of the radius of the smallest ball con- 
taining E and the radius of the largest ball contained in E. For d>2, we 
assume that IC is "regular" with respect to A, that is, /C is chosen such that 
the sets B^{x) n A have bounded aspect ratio uniformly over x £ A,r > Q. 
When /C = M'^, this condition is trivially satisfied. If A is the unit cube, then 
/C may be either a tilted orthant or a right circular cone not tangent to any 
coordinate subspace. 

Given the cone /C, x G M'', a finite set X cW^, and € N, put 

(2.2) Ck{x,X):=C'^{x,X):= (J B^{x). 

t>0: ca.rd{Bf- (x)r\X\{x})<k 

Here, card(3^) denotes the cardinality of the finite set y. If card((x + /C) n 
X \ {x}) > k, then C^{x,X) is the largest set of the form Bf'{x) containing 
fewer than k points oi X \ {x}\ otherwise, C'^{x,X) is the whole "wedge" 
X + /C . When /C = R"^ , (x , ;f ) is a ball whose radius is the distance between 
X and its kth nearest neighbor in X\x. 

For each n > 2 and Xj, 1 < i < n, we use the directed nearest neighbor 
cells C^{Xi,Xn) to define high-dimensional spacing functionals analogous 
to the classical one-dimensional functionals (1.1). Define for 1 <i <n the 
transformed feth nearest neighbor spacings 

DU,k--=G{C'^{X,,X^)). 

Given cp £ J^, define the random point measure v^^k-i with total measure 
^n,0,fc' as follows: 

n n 

i=l i=l 

Here, 5x denotes the unit point mass at x. Let T^n (f> k ■~ '^n (p k ~ ^i'^n (p k\ 
the centered version of the measure Vn4>k- 

Henceforth, we call ^ ^ the "/c-nearest neighbors spacing statistic," or 
"empirical nearest neighbor (^-divergence"; it provides a high-dimensional 
analog of the statistic (1.1). Our main concern is with the limit theory of 
K,4>,k and iV^^^^fe. 

The statistic A'^^ ^ ^ provides an empirical measure of the discrepancy of 
the proposed distribution G from the (typically unknown) true distribution 
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F. For example, if /c = 1, then equating Z)/^ ^ with its approximate expected 
value of 1/n yields the approximation N^^ ^ i ~ Z^i 4>{Di n,i/D{.^ i), and thus 
1 provides a naive empirical estimate for the so-called (p- divergence 
[1, 2, 3, 9, 10, 11] of g from / which is defined by 

(2.4) l4>i9J)--= jj{j^^f{x)dx. 

In general, I^{g,f) is possibly negative, and I ,f,{g, f) =I^*{f,g) where (j)*{x) := 
x(f){x~'^). Also, 

(2.5) /0(/,/) = (/.(l); 

(2.6) I^{g, /) > I^{f, f) if (j) is convex. 

Choices oi (p ^ T figuring prominently in estimation and decision theory 
include: 

• (/'o(a^) := — log j; defines Kullback-Leibler information (also called the mod- 
ified log-likelihood ratio statistic or relative entropy) and is used in max- 
imum spacing methods, 

• (t>i/2{x) := 2(1 — y/x)"^ yields the square of the Hellinger distance, 

• (l)i{x) :=xlog3; yields the log-likelihood ratio statistic or I-divergence of 
Kullback-Leibler , 

• (t)2{x) := {x — 1)^/2 yields the chi-squared divergence, and 

• (l)^'^'{x) := x"^ yields information gain of order r, r > 0. 

The ^-divergences N^^f, and I(i,{g,f) ("coefficients of divergence") are 
used heavily in goodness-of-fit tests [36] and are useful in characterizing 
the amount of information of one distribution contained in another [36, 37]. 
Nearest neighbor cells have been used in goodness-of-fit tests in multidi- 
mensions in [6, 38, 43], among others. Note that (2.6) shows I^Q{g,f) and 
I<pi{9:f) are nonnegative, and that I^{i/2){g,f) is symmetric in / and g. 

The following integral transforms of (p (defined for /3 > 0) arise naturally 
in the asymptotic analysis of f ^ ^ ^ (the random variables were defined 
in Section 2.1): 

(2.7) M^^kiP) ■.=mw^k)], 

(2.8) A0,fc(/3) := {k + l)M0,fe(/3) - kM^^k+m, 

(2.9) + / [E[mCk{o,nuy)\)mCk{y,nuo)\)] 

-M^,k{Pf]dy. 

Note that M^^i{x) = {l/x)(p{l/x), where cp denotes the Laplace transform 
of (p. 
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2.3. A general CLT for (f)- divergences. The following general central limit 
theorem, our main result, establishes convergence of n~^^'^{h,T'^ ^ j^) to a 
mean zero normal random variable whose variance is a weighted average 
of the functions V^^fc and A^^^. For h e 13{A), we define the h-weighted (/>- 
divergence of g from / by 

U{g,f,h) := jj{x)<p{^j^^h{x)dx, 

which in the case h=l reduces to the (/)-divergence I^{f,g) defined at (2.4). 
Also, for /i, /ii,/i2 in 13(A) and (p£j^, we define the functions h? ,hih2,4>'^ 
pointwise, that is, h?{x) = and so on. 

In the theorem below, since the formula (2.10) is rather concise, we expand 
it in (2.11). We prove the theorem in Section 4, referring to [4] for some of 
the details. 

Theorem 2.1. Suppose that either cl)e J^Q,d=l, or }C = R'^. As oo, 
it is the case that for h £ B{A), 

Var [(/i, z.^^^ ,)] ^ /y,,, [g, f,h')- (Ia,,, [g, /, h)f 
= j^h\x)V^,,{j^^f{x)dx 

and 

(2.12) n~y^{h,vl^^,) ^ iV(0, (g, /, h') - (/a,,, {g, f, h)f). 

Putting /i = 1 in Theorem 2.1 yields a CLT for the empirical (/)-divergence 

^~'^'iK^,k-^K<t>,k)^miv,j9,f) - (/a,,, (5, /))'). 

For practical purposes, it is of use to compute F^^fc, and the next two results 
show how to simplify the expression (2.9) in some special cases. Using these 
simplifications, we may explicitly identify V^^k for certain choices of 0, as 
shown in Section 3. 

The first of our simplifications applies when /C 7^ M^, and either = 1 or 
d = 1. The latter case is particularly relevant to the study of spacings (see 
Section 2.4). 



(2.10) 
(2.11) 
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Proposition 2.1. If JC ^ W^', and either d= 1 or k = 1, then for all 
P > 0, we have 

(2.13) 

fc-i 

+ 2 ^ Cov[<l){(3T,),<Pif3(T,+, - r,))], 
i=i 

the sum being interpreted as zero for k = l. 

Our second simplifying formula for ^ is applicable when k = 1, KL = 
W^, and (j) is differentiable with limjjo ^Pit) = 0. This will provide limiting 
distributions for some cases of interest, including information gain and log- 
likelihood in high dimensions (see Section 3.2). For s,i,ii£M+, let I{s,t,u) 
be the volume of the intersection of two balls in W^, with respective volumes 
s and t, at a distance u apart. Set 

(2.14) Jdis,t) := / [ei{s,t,{u/u^,) I ) _ x\du. 

J max(s,i) 

Proposition 2.2. Suppose that /C = M'^ and that 4) £ T is differentiable 
with lim^j^o 0(0 = 0- Then for all (3 > 0, 

F0,i(/3)=M.2i(/3)+/32 / / ct>'{()s)(t>'me-'^'^'\jd{s,t)-m?a^{s,t)]dsdt 

Jo Jo 

provided that the integral exists. 

Remarks, (i) {Related work) Bickel and Breiman [6], and subsequently 
Schilling [38], consider the functionals N^^-^ when (/)(a;) = exp(— x) and 

/C = M'^. Using the approximation -Of „ ^ g{Xi)\Ci{Xi,Xn)\, they estab- 
lish a CLT for the empirical process of nearest neighbor distances, but do 
not consider convergence of the associated random measures. Zhou and Ja- 
malamadaka [43] establish the central limit theory for the functionals A'^^ ^ ^ 
for certain (j) of bounded variation for the case g = f as well as for the 
case involving a sequence of appropriately converging alternatives. Strong 
limit theorems for multivariate spacings using general "shapes" are given by 
Deheuvels et al. [14]. 

(ii) {Finite- dimensional CLT) By standard arguments based on the Cramer- 
Wold device, it is straightforward to deduce from Theorem 2.1 the conver- 
gence of the finite-dimensional distributions of n~^/'^T'^^ ^ ^ as n ^ oo [i.e., 

the convergence of the m-vector 'n~^^'^{{hi,T'^ ^ f^) , . . . , {hm,T^n rj} k)) 

hi, ... , hm in ;B(^)] to those of a mean zero finitely additive Gaussian field 

with covariance kernel 

(2.15) (/ii, /12) ^ Iv^^k ia, f, hih2) - lA^ fe (g, f, /ii)/a^,, (g, f, /i2). 
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(iii) [Poisson CLT) For A > 0, A; G N, the Poisson analog of measure f ^ ^ ^ 

is 

(2.16) ^il^y.= H^G{C,{x,Vx)))Sa,, 

xePx 

and its total measure is a Poissonized version of ^ ^. Our approach yields 
a proof (see Proposition 4.1 below) that (j) & J- and h € B{A), then as 
A oo, 

(2-17) X-'Y^v[{h,i,i^^,)]^ Iv,Jg,f,h^) 

and A"^/^/^^ ^ ^ converges in law to a mean zero Gaussian field with covari- 

ance kernel ihi,h2) ^ ly^j^ig, f,hih2) (here Jll^^^ f. ■= fJ'x,^,k -^[fJ'X,^,k\)- 

(iv) {Law of large numbers, limits are distribution free) Our approach 
(see also [31]) also yields a weak law of large numbers, namely 

u^^^,) ^ Im,^, {g, f, h) yh G e(^), </. G F. 

By taking /i = 1, we obtain a weak law of large numbers for the fc-nearest 
neighbors spacing statistic N^^^. Combining this with Theorem 2.1 and 

taking g = f, we see from (2.5) that the limiting mean of n~'^{h,v^^^^j}i 
and the limiting variance and distribution of n~^^'^{h,V^ ^ ^) do not depend 
on / for h = 1 (and, in fact, for any h). Therefore, the nearest neighbor 
functionals are asymptotically distribution free under the null hypothesis g = 
f and have asymptotic variance y^^fc(l) — (A(^^fc(l))^. A possible goodness- 
of-fit test would be to take the density g to be tested, compute the functional 
1 see whether the cumulative distribution function is close to the 
-^(0, V^^i(l) — (A0^i(l))^) cumulative distribution function. 

(v) ( Voronoi cells) Volumes of nearest neighbor cells are computationally 
attractive and have correlations decaying exponentially with the distance 
between cell centers. Defining point measures analogous to (2.3) based on 
cells generated by any locally defined Euclidean graph (e.g., Voronoi cells) 
leads to similar CLTs, adding to the laws of large numbers given in [24]. 

(vi) {Properties of limiting variance) In most of our examples, A^^^ is 
strictly positive, showing that Poissonization leads to a larger limiting vari- 
ance. When Vfp^k is convex, which is the case when k = 1, 4>{x) = x'^ ,r E [1, oo) 
or when 0(a;) = a;logx (see Section 3.1), then inequality (2.6) implies that 
the limiting variance over Poisson samples is minimized when g = f. 

2.4. Asymptotic normality of sum functions of spacings. In dimension 
d=l, if g is a probability density with distribution function G on [ci,C2], 
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then the generalization to A;-spacings of the empirical (^-divergence defined 
at (1.1) is the classical /c-spacing statistic defined by 

n—k 

(2.18) Sl^^^ := ^ </.(n[G(X(,+fc)) - 

i=l 

Developing the limit theory for Sf^^j^ over continuous samples is important 
in goodness-of-fit tests. We can apply the general theory of Section 2.3 by 
putting d = l and /C = (0, oo). Then the width of C^{x,X) is the distance 
between x and its fcth nearest neighbor in "to the right." Thus, the k- 
nearest neighbors spacing statistic N^^^,, defined by (2.3), is the same as 
^n4>k with the sum in (2.18) extended to n terms and with := C2 
if j > n. 

To better match the existing literature, we consider a modified version of 
Theorem 2.1 in which we redefine C^{x,X) to be the empty set whenever 
card(^ n (x + /C) \ x) < fc, and set (p{0) = 0. Denote by i^n<i>k ^he analog of 

fc under this modification (here we suppress the dependence on g) , that 

is, 

n—k 

(2.19) ul^^, := ^ </.(n[G(X(,+,)) - G{X^^)])Sx,. 

i=l 

The corresponding centered measure is then denoted v^^^- If d=l and 
K, = (0, oo), the total measure of i/* ^ ^ is indeed equal to ^ 

Theorem 2.2 (Gaussian limit for sum functions of spacings). Let A := 
[ci,C2], /C = (0,oo) and (p £ J^. Then the conclusion of Theorem 2.1 holds 
with replaced by ^ncfyk- Moreover, in this case, V(j)^k{l3) is given by 
(2.13). ' ' 

The proof of Theorem 2.2 is a straightforward modification of the proof 
of Theorem 2.1; see Theorem 2.2 of [4] for details. 

Applications of Theorem 2.2 are given in Section 3. This result, like our 
main result, shows that sum functions of spacings are asymptotically distri- 
bution free under the null hypothesis f = g. 

Remarks, (i) Darling [13] undertook the first systematic study of the 
functionals Sn,(i>,k when k = 1, but restricted attention to uniform samples. 
Theorem 2.2 generalizes Hoist [21], as well as earlier work of Cressie [8], who 
proves asymptotic normality (but not convergence of f * ^ ^ against bounded 
test functions) for sum functions of fc-spacings over uniform points. Hoist 
uses a generalization of LeCam's method and a CLT for /s-dependent ran- 
dom variables. In d=l, Hoist and Rao [22] prove asymptotic normality of 
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^n<i>k under "somewhat stringent conditions" on / and g. Mirakhmedov [28] 
considers the error term in the CLT for the functionals Sn,4,,k when F and G 
are the uniform distribution on [0,1]. For nonuniform samples, the asymp- 
totics of 5^ ^ ^ have been widely studied under the assumption that G runs 
through a sequence of alternatives G„ approaching the uniform distribution; 
see Hall [20], Kuo and Rao [27] and del Pino [34]. Khashimov [26] estab- 
lishes asymptotic normality oi S^^^^ under rather technical differentiability 
conditions on (j) and /. 

(ii) The approach used here also yields a weak law of large numbers, 
namely convergence in mean-square of n~'^S^ <f,k^^ ^M4,i9^ /)• This extends 
the corresponding weak laws in [25]; see also [17]. Analogous results hold for 
nonover lapping A;-spacings [39]. 

2.5. Divergences based on cells of fixed radius. Instead of considering 
point measures based on spacings, we now consider using cells of fixed radius 
depending on a continuous g: M+ and a parameter t. Thus, given cp £ 
and t > 0, we define 

H'ny-=-2 E </'(card{A'„ni?,(„^(,))_./4^)}-l)- 

When (j){x) = x and g = 1, then i?^'^ counts the total number of pairs of 

points in distant at most n~^^'^t from each other. 

The following CLT is obtained by modifying the proof of Theorem 2.1; 
we refer to Theorem 2.3 of [4] for details. 

Theorem 2.3 ([4]). (Gaussian limit for the number of pairs of points 
within distance t ). For all continuous g:A^ M~^,t > 0, and (p £ J-', there is 
a constant erf ^ g{f) such that as n^oo we have n~^Yai[H^f^] -^crf^g{f) 
and 

Remarks. The limiting variance <7^0g(/) takes the form of the right- 
hand side of (2.10) with h=l and with the functions V^^k and 5^,k suitably 
modified; see [4] for details. 

Various authors have studied H^'^^ when (^(x) = x and g = 1; see Chapter 
3 of [30] and references therein. Jammalamadaka and Zhou [23] and also 
L'Ecuyer et al. [16] consider H^'^ from the point of view of goodness-of-fit 
tests, (the latter reference only for uniform samples). Penrose [30] (Chapter 
4) proves that the finite-dimensional distributions of the process H^'^^,t > 0, 
converge to those of a Gaussian process. 
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3. Applications. 

3.1. Classical spacing statistics. For many tests involving goodness-of- 
fit (Dudewicz et al. [15], Blumenthal [7], Cressie [8], Hoist and Rao [22], 
del Pino [34], Weiss [42]) and parametric estimation (Ghosh and Jammala- 
madaka [19]), it is important to know the asymptotic distribution of ^ ^ 
[defined at (2.18)] for arbitrary g and / and for various choices of 4>. The 
following provides some illustrative examples. For simplicity of exposition, 
we have chosen to state our central limit theorems for the statistic S^^j^; 
the results for associated random point measures (2.19) are given in [4]. 
Throughout Section 3.1, we write V^j^ for the value of V^^^ given by (2.13). 

3.1.1. Limit theory for logarithms of spacing s. Let 

n—k 

Siio,,k ■■= E log(n[G(X(,+,)) - G(X(,))]) 
1=1 

denote the sum of the logarithmic /c-spacings. Setting (j){x) = logx in Theo- 
rem 2.2 and appealing to (2.8) and (2.13), we find a CLT for logarithms of 
A:-spacings as follows. 

Let tp be the di-gamma function with ^l^{k) := X]i=/^~^ ~ 7) where 7 is 
Euler's constant, and let tp'ik) := - Ei=i^ ^"^ + 7r^/6. 

By Cressie [8] and Hoist [21], 
fc-i 

J2 Cov(iogrfc,iog(rfc+,- - r,)) = kik - l)^p'ik) -{k-i). 

Also, E[logrfc] = ij{k), so we have 2A;E[logrfc](Elogrfc-Elogrfc+i) = -2ij{k). 
Also, E[log^rfc] =Tp'{k) + {'4){k))'^. So, combining terms and using (2.13) for 
4'{x) = logx gives 

(3.1) vi^^,{i) = ij'ik) + {m? - mk) + 2[k{k - i)^p'{k) -{k- 1)]. 

By (2.8), we have Aiog,fc(l) = {k + l)il:{k) - ki;{k + 1) = ^Pik) - 1. 

Using simple relations such as Cov(log/3X, log/?y) = Cov(logX, logY), it 
is straightforward to deduce that 14og,fc(/^) = ^ifg,fc(l) + log^ /5+ 2 log I3{ip{k) - 
1) and Aiog,fc(/3) = Aiog,fc(l) + log/3. Substituting this into Theorem 2.2, 
putting Tfc := (2/c^ — 2k + l)ip'{k) — 2A: + 1, and rearranging terms yields the 
following corollary. 

Corollary 3.1 (CLT for logarithmic fc-spacings) . Let X,Xi,X2, ■ ■ ■ he 

i.i.d. with density f on [0, 1] . As n ^ co, Var[5^ ^] rfc+Var[log(^^^)] 
and 

'^"'/'(^liog,. - IES^,iog,.) ^ A^(o,r, + Var 
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Remarks. When A = [0, 1] and f = g = l, the CLT for 5^ ^ was es- 
tablished by Darhng (Section 7 of [13]) for A; = 1 and later by Hoist [21] and 
Cressie [8] for general k. When the Xi have a step density, Cressie shows 
asymptotic normality of j. including cases when A; — > oo . Czekala (The- 
orem 1 of [12]) apparently rediscovered Cressie's result. Shao and Hahn [40] 
treat general densities for k= 1, although their proof depends upon inter- 
changing limits in order to pass from step densities to arbitrary densities. 
When A: = 1, Blumenthal (Theorem 2 of [7]), proves Corollary 3.1 for densi- 
ties / satisfying special conditions. Corollary 3.1 extends all of these results 
to / and g bounded away from zero and infinity, resolving a conjecture of 
Darling ([13], page 249) affirmatively. 

3.1.2. Information gain of order r. Let (t){x) = x'^,r > 0. We write S^^.^ 
to denote S^^^, also known as Renyi's information gain (I-divergence) of 
order r in d = 1, that is, 

n-l 

Slr,i :=E(^TOm))-G'(X(,))])^ 

i=l 

Let Wr := -2rr2(r + l)+r(2r + l) and U := r(r + 1)(1 - r). It is a simple 
matter to verify via (2.13) and (2.8), respectively, that for all /? > 0, 

Vf^^iP) := Wrfi^' and A^,i(/?) := 2E[,/.(/3ri)] - E[0(/3r2)] = trl5\ 

Put 

Theorem 2.2 yields the following corollary. 

Corollary 3.2 (Gaussian limits for information gain). Let Xi,X2, ■ ■ ■ 
be i.i.d. with density f on A:= [ci,C2]. As oo, we have for all h G 13(A) 

n"^Var[S'^ ,, i] ^cr^(/,5) 
and n~y^{Sl,^,-ESi^,^,) ^ N{0,aUf,g)). 

Remarks. It is easy to verify using [5] that cr^(/, g) > except when r = 
1. Corollary 3.2 extends upon the CLTs of Darling [13] (uniform case) and 
Weiss [42] . Moran [29] proved a CLT for the functional ^ ^ over uniform 
random variables. 
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3.1.3. Limit theory for log-likelihood ratio. Let (pix) = xlogx. Consider 
the log-likelihood point measure 

1=1 

and let denote the total mass of this measure, also called the log-likelihood 
statistic. Again, denoting Euler's constant by 7, we have for /3 > that 

E[PTi log(/3ri)] = /31og/3 + /3(1 - 7); 
E[PT2 log(/3r2)] = 2/31og/3 + (3(3 - 2j); 
E[(/3ri log(/3ri))2] = (3^[2{log f3f + (6 - 47) log/3 + 2 + - 67 + 27^]. 

Using these in (2.13) and (2.8), respectively, it is easily verified that 

V^iW) := - 2^/32 and A^,i(/3) := 2E</.(/?ri) - E</.(/?r2) = -/?. 
Put 

.J(/,.):^(^-2)/^ff..-(/^.(.)..)^. 

Let X have density / and note that since g is a density we have 

.5(/..)^(^2)va.[f|f^]4-3. 

Using the above values for ^cpA, o'^(/,5), and applying Theorem 2.2 
for 4>{x) =xlogj; yields the following corollary. 

Corollary 3.3 (Gaussian limit for log-likelihood). Let Xi,X2,... be 
i.i.d. with density f on A := [01,02]- As n — > 00, n~^Vav[S^^-^]^a'^{f,g) 
and 

Remarks. Corollary 3.3 extends the results of Gebert and Kale [18], 
who assume uniformity of Xi and Czekala (Theorem 2 of [12]), who assumes 
that Xi have a step density, van Es [41] establishes asymptotic normality 
for S*? whenever /c, n — > 00, A: = o(n^/2), and /:^^ [0, 00) is Lipschitz. 



3.2. Information gain and log-likelihood in high dimensions. In this sec- 
tion, we put k = 1 and JC = R*^. 
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3.2.1. Information gain of order r. Let = x"^ ,r £ M"*", so that N^^ ^ ^ 
defined by (2.3) yields Renyi's information gain (I-divergence) of order r. 
For all r £ M"*" , define the constant 

Kr:=r^ / s''~H'-^e-^'+^^[Jd{s,t)-msix{s,t)]dsdt, 
Jo Jo 

with Jd{s,t) given by (2.14). Since (p satisfies the conditions of Proposition 
2.2 and since E[(/>^(ri)] = r(2r + 1), the following is immediate. 

Lemma 3.1. For all /3 >0 and for (f){x) = x*", r > 0, we have that V^,i(/3) = 
/32^[r(2r + l) + if,]. 

Note that 0^''' = 4)^ {(3). Combining Lemma 3.1 with Theorem 2.1 yields 
the following CLT for N^^^^^. 

Corollary 3.4. Let (t){x) = x^,r > 0. Then as oo n~^Yav[N^^-^] 
converges to [r{2r + 1) + Kr]I^2^i{g, f ) - (/a^^ (/, g))^, and 



V 



iV(0, [r(2r + 1) + i^.] V,i(5, /) - (/a,,, (5, /))')• 



3.2.2. Log-likelihood. When (?!)(x) =xlogx, A^^^ defined by (2.3) yields 
the log-likelihood statistic. To apply Theorem 2.1, we define the constants 

POO POO 

Ii:= / (log s + 1) (log t + l)e~^''+*^ [Jd(s, t) -max(s,t)](is (it, 
Jo JO 

POO POO 

/2 := / / {logs + l)e~'^''+^>[Jd{s,t) -max{s,t)]dsdt 
Jo Jo 



and 

poo POO 



POO POO 

/3 := / / e"(^+*) [Jd{s, t) - max(s, t)] ds dt. 
Jo Jo 

Also, set i^i := 2 + f - 67 + 27^ + /i, K2 := 6 - 47 + 2/2 and A'3 := 2 + Ig. 
The following is an easy consequence of Proposition 2.2. 

Lemma 3.2. For ^{x) = xlogx, V^^i{(3) = (5'^{Ki+K2\ogfi+K^{\ogfif), 
P>0. 

Theorem 2.1 yields a CLT for the log- likelihood functional N^.-^. Put 



f{x)dx. 
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Corollary 3.5. Let (j){x) = x log x. Then as oo, n ^Var[iV^^J^ 
(^l{f,9) - (lA^JgJ)? and 

4. Proof of Theorem 2.1. The proof of Theorem 2.1 involves expressing 
the Poissonized measure /J-x^j^ [see (2.16)] as a sum of weakly spatially 
dependent terms, ahowing us to establish convergence of the variance of the 
measure /^^ ^ ^ (Proposition 4.1). Although the measures in question share 
neither the same representation nor the same scaling properties as those 
considered in previous work [5, 31, 32, 33], once we have shown the crucial 
variance convergence for measures defined in terms of Poisson samples, we 
can draw upon some well-established dependency graph techniques [31, 32, 
33] to deduce a Poissonized version of Theorem 2.1. Using arguments in [32], 
we may de-Poissonize and deduce Theorem 2.1 when (p is bounded on (0, 1]. 
Deducing Theorem 2.1 for general (p requires extra technical effort. 

Recall that for all a > 0, TCa is a homogeneous Poisson point process of 
intensity a on M*^. Suppose we fix the set ^4 C M*^, the densities / and g and 
their corresponding distributions F and G on W^, as described in Section 
2.1. 

For all A > 0,x G M'^, and all finite X C M"^, we lighten the notation and 
write C{x,X) for C^{x,X) given by (2.2). For ah Borel B C M*^, define the 
numbers ^x{x,X) = ^{{x,X) and Cx{x,X,B) := (^{{x , X , B) by 

^x{x,X):=^{\G{C{x,X)y, Ux,'V,B):=<^x{x,X)6,{B). 

Recalling that Xn := {^i, . . . , Xn}, we have 

n 

^lUB)=T.^'niX,^Xn,B). 
i=l 

The (signed) point measure £,xix, X, •) is determined by x and X, in a similar 
manner to the measures considered in [32], but here, unlike in [32], the 
measure ^xix,X,-) is not obtained by scaling the measure (because the 
function g enters in a more complicated way into the definition of ^x here) 
so we cannot directly apply results from [32]. We write {h,(,x{x,X)) for 
lR'ih{y)^x{x,X,dy). 

For locally finite X C^'^ and x £ A, we define 

(4.1) C'ofi^) :=e^'''(^) ■■=mx)\C{0,X)\). 

For all X G A and given G N, let to{x) denote the infimum of all t with 
the property that B^{x) H j4 is the same for all u>t. Given also a locally 
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finite set X C A and /C, and writing #(•) for card(-) and X \ x iov X \ {x} 
here, define 

Ri{x,X) 

' inf{t E M+ : #{Bf{x) r\X\x)>k}, if + IC)nX\x)>k, 



to{x), otherwise. 

Thus, Ri{x,X) is the distance between x and its kth nearest neighbor in 
X in the direction of the cone /C or if no such neighbor exists, the furthest 
one has to look from x to ascertain that this is the case. For A > 0, let 
Rx{x,X) := \^/'^Ri{x,X). The following lemma establishes the equivalent 
of the "exponential stabilization" conditions discussed in [32]. For a proof, 
see [4]. 

Lemma 4.1 ([4], Lemma 4.3). It is the case that 

(4.2) limsup sup t^'^ log P[Rxix,Vx) > t] <0 

t^oo xeA,X>l 

and 

(4.3) limsup sup t~^logP[Rx{x,XnU A) > t] <0. 

t^oo xeA,\>l,{X/2)<n<{3X/2),AeS3 

Recall (2.16) that fJ^Xcpk denotes the Poissonized version of Vx<f)k- "^^^ 
following is a Poissonized version of Theorem 2.1 for (p G J-q and is of inde- 
pendent interest. 

Proposition 4.1. Let (pe JTq and h e B{A). Then as X^oo 

(4.4) Var [(/i, f.^^^,)] ^ h\x)V^,u f{x) dx = ly,,, (<7, /, h') 

and X~^^^Ji^^ ^ converges in law as X ^ oo to a mean zero Gaussian field 
with covariance kernel {hi,h2) ly^ ^{g, f,hi)Iv^ ^.{g, f,h2)- 

Proof. For simplicity, we first assume that h is a.e. continuous. It is 
the case that 

A-iVarK/.,4^_,)] 

= xf f h{x)h{y){E[^x{x,VxUy)^x{y,VxUx)] 

(4.5) ^^^^ 

-E['^x{x,rx)]mx{y,rx)]}f{x)f{y)dxdy 



+ / h^{x)E[<!>l{x,Vx)]f{x)dx. 
J A 
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We will sketch an argument (see Proposition 4.1 of [4] for details) showing 
that Var[(/i, /i^ , ^.)] converges to 



(4.6) 



/ / /i2(x)[Ee"(W/(,) U z)e"(-^ + (W/(.) U 0)) 

-{EC3^-{nf^,))f]f\x)dzdx 



+ 



Putting y = X + X ^^^z in the right-hand side in (4.5) reduces the double 
integral to 



(4.7) 
where 



h{x)h{x + X~^/'^z){- ■ ■} fix) fix + X~^/'^z) dzdx 



{• • •} := {mxix, Pa U {x + A-i/'^4)$a(x + A'^Z-^z, Vx U x)] 

- E[$a(x,Pa)]E[$a(x + X-^/''z,Vx)]}. 

By using Lemma 4.1, we may show (see [4] for details) that {• • •} converges to 
the bracketed expression in the first term of (4.6), and that the integrand in 
(4.7) is dominated by an integrable function of z over W^. The convergence 
of the double integral in (4.5) to that in (4.6) now follows by dominated 
convergence, the continuity of h and fourth moment bounds on <I>a- To show 
convergence of general h S BiA), we refer to [32]. 

Similar but easier methods show convergence of h?ix)E,[<^\ix,'P\)\fix) dx, 
completing the proof that (4.5) converges to (4.6). 

For all X & A, we define V^j^ix, 0) := and for all a > 0, we put 

vlJx,a):=m'J'i'Haf] 



+ a / [Ee^^XWa U Z)e^(-Z + iHa U 0)) - (Ee"(Wa))'] dz. 



Using (4.1), it is easy to see that 



F« (x,a)=E 



gix). 



+ 



E 



aix) 



(4.8) 



X (p 



a 

dix) 



\cio,nuy)\ 
\ciy,nuo 



E 



gix) 



dy 
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and in particular, by definition of V^^fc [recall (2.9)], we have 

y|,(x,/(x)) = y,,,(^ 

By combining this with (4.6), we thus obtain the desired limiting variance 
(4.4). 

The proof of the second part of Proposition 4.1 (i.e., convergence to the 
normal) follows from arguments similar to those used in Theorem 2.2 of [32], 
which itself follows dependency graph arguments in [33]. Here, we note that 
^^iT~if[x)) corresponds in our setting to the limiting expression from Lemma 
3.4 of [32], and consequently appears in expressions for limiting variances 
arising from following the proofs in [32] , where all expressions for limits are 
obtained through Lemmas 3.4 and 3.5 of [32]. □ 

To obtain Theorem 2.1, we shall de-Poissonize Proposition 4.1 by suitably 
adapting the proofs of Theorems 2.1, 2.2 and 2.3 of [32], and then extend 
the result to all G by using truncation arguments. 

Proposition 4.2. Suppose (j) ^ Tq. Then the conclusions of Theorem 
2.1 hold. 

Proof. Taking Proposition 4.1 as our starting point, we can follow 
nearly verbatim the de-Poissonization argument of Section 5 of [32] which is 
used there to prove Theorem 2.3 of [32]. See Proposition 4.2 of [4] for details. 

To obtain the limiting variance in the present setting, it suffices to consider 
the corresponding limits obtained in Lemmas 3.4 and 3.5 of [32]. That is, 
analogously to the definition of 5(x,a) in [32], we define for all x G vl and 
all a > 

A«,(x,a): = E[er(Wa)]+a / [Eel^-(W, U y) - ^^-(7^,)] 



9{x) 
1 k 



+ a / ¥.[(t>{g{x)\C{Q,na^y)\) - 4>{g{x)\C{0,na)\)]dy. 

Changing variables y ^ a^^'^y, using (2.7) and the equivalence a^/'^TLa = 'H, 
yields 



(4.9) =M,,{^) 



E 



i^\c{o,nuy)\] - cf>(i^\c{o,n)\ 



dy. 
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We now show that A|^(x,a) reduces to A,^ fc(g((x)/a) defined by (2.8). Put 
f5:=g{x)/a and hd := \Bf{0)\. Since |C(0,W)| =rfc, (4.9) yields 

4,fc(^>«)-^M(/3) 

E[(0(/5|C(O, W U y)\)- mC{^,'H)\))lMy\'' < ^k}] dy 



(4.10) 

' E[(0(/3max(6rf|2/|^rfc„i))-(/.(/3rfc))l{6rf|y|'^<rfe}]dy 



K. 

= E r"'' (l)if3Tk-i)ds + E 0(/3s)ds-E[rfc</.(/5rfc)], 

where we put s := |i?^|(0)|. The third term in the right-hand side of (4.10) 
is 

(4.11) E[Tk(t>{Prk)] = s(t,{f3s)j^—^e-'ds = kE[<p{PTk+i)] 

and likewise, the first term is {k — l)E^(/?rfc). Recalling that = J2i=i 
and setting t = s — r^-i, we find that the middle term in the right-hand side 
of (4.10) is 

Ej^ mrk-i + t))l{t^r,,,}dt = Ej^ <P{l3{rk-i + t))e-'dt 

= E[(j){prk))] = M^,k{P). 

Combining these expressions for terms in the right-hand side of (4.10) yields 

Al^{x,a) = {k + l)M<^,fc(/3) - A;M<^,fc+i(/3) := A<^,fc(/3). 
The result (2.12) then follows from the proof of Theorem 2.3 of [32]. □ 

We now extend Theorem 2.1 to cases with (pG J^\J^q (i.e., where (p "blows 
up" at 0) via a truncation argument. Given e > 0, define the functions 
0=:M+^M and (/)£:M+^Rby 

(j){x), if X > e, 
0, otherwise. 



cpeix) :-- 



4>{x), if a; < e, 
0, otherwise. 



To prove Theorem 2.1 for (p^T when either /C = M'^ or d = 1, we will use 
the following lemma, whose proof is given in [4] . 

Lemma 4.2 ([4], Lemma 5.1). Given h G BiW'-) and 5 > 0, there exists 
£0 > and no > such that for e G (0, Eq) and n>no we have n"^ Var [(/i, ^ ^ 
6. 
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Before stating the next lemma, we define for all /3 > 0, ?/ G M'^, G and 
e>0 

(4.12) ^P{P, y) := E[mC{0, Ti U y)|)</.(/3|C(y, W U 0)|)] - (E</.(/3rfc))2 
and 

(4.13) MP,y):=E[cP%P\CiO,nUy)\Wi(3\Ciy,nuO)\)]-{EcP%(3r,)f. 

We also define ax := E[(l)* {T ^ / Kf] + E[(j)*{Krkf] for K > 0, and ob- 
serve for any K > 1 that qk < oo. Also, if fi < then since is 
decreasing on (0, 1) and increasing on (1,00), 

E[ct^*{fiTkf]=E[ctr{PTkfl{Tk < l/(3}]+E[<j)*ipTk)H{rk > 
<E[(P*{Tk/Kfl{Tk<l/P}] 

(4.14) 

+ E[(P*{KTkfl{Tk>l/P}] 

< UK- 

The proof of the next lemma is technical and is given in [4]. 

Lemma 4.3. ([4], Lemma 5.2). Let K > 1. Then there exists a Lebesgue 
integrable function :R'^ — > [0, 00), such that 

(4.15) \A{P,y)\<rKiy) VyGM'^\{0},eG(0,l],/3G[l/K,K]. 

Our next two lemmas, proved in detail in [4], show that V^e,fc(/3) and 
A(j)E^kiP) defined by (2.9) and (2.8), respectively, converge to V0^fc(/3) and 
A^'kiP) as e i 0. 

Lemma 4.4 ([4], Lemma 5.3). For all f] > and k G'N, V^p^kiP) satisfies 

(4.16) limy^.,fc(/3) = y^,,.(/3). 

Moreover, given K £ [l,oo), it is the case that 

(4.17) sup{|y^e,fc(/3)| : < e < 1, 1/K < /3 < i^} < cx). 

Lemma 4.5. For any (3 > and /c G N, we have 

(4.18) limA^.,fc(/3) = A^,fc(/3). 

ej.0 

Also, given KG [l,oo), it is the case that 

(4.19) sup{|A^e,fc(/3)| ■.0<e<l,l/K <(3<K}<^. 
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Proof. Let 0* be the dominating function given by (2.1). If /? > 0, then 
it is straightforward (see (5.24) of [4]) to see that (^*(/3rfc)^ is a nonnegative 
integrable random variable which dominates (ff{(3Vkf', so by the dominated 
convergence theorem, as e | we have 

(4.20) E[0^(/?rfc)]^E[</.(/3rfc)]. 

By (2.8) and (4.20), we obtain (4.18). Also, (2.8) implies |A0.,fe(/3)| <{k + 
l)W.[4)*{f5Vk)\ + k'E[<i)*{l3Tk+i)] and the bound (4.19) easily follows from this 
with (4.14). □ 



Given h G B{A), let Lh{4>) be the limiting variance in the statement of 
Theorem 2.1, that is, let 



(4.21) 



Lemma 4.6. Given h G B{A), it is the case that 
(4.22) limL^(<^^) = L;,(<^). 



Proof. By assumption, {g{x)/ f{x),x G A) is bounded away from and 
oo, and / is bounded. Hence by (4.17), the integrand in the first integral in 
the expression (4.21) for Lh{4)^) is bounded by a constant, not depending on 
e. Similarly, by (4.19), the integrand in the second integral in the expression 
(4.21) for L/i((^^) is bounded by a constant, not depending on e. By (4.16) 
and (4.18), for both integrals the integrand converges, as e J, 0, to the corre- 
sponding integrand for L/j ((/>). So, by the dominated convergence theorem, 
the integrals converge and (4.22) follows. □ 

Proof of Theorem 2.1. Let /i G B{A). Given 5 > 0, by Lemmas 4.2 
and 4.6, we can find eo > and no > such that for e < eo and n > no, we 
have 



(4.23) \Lhm-LHm<5 

and n~^ Var[(/i, f,^ ^)] < 5. The function (ff lies in the class J-q, so by 
Proposition 4.2, 

(4.24) Jim n-iVar[(/.,z.^^^. ,)] =L,(#) 
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and hence by the Cauchy-Schwarz inequahty, for large enough n, we have 

n"i|Var((/i,<^^,))-Var((/.,<^.^,))| 

= \n-' Var((/., u^^J) + 2 Co^^n-'/^h, u^^.^,) , n~^l'{h, ul^J)\ 

< S + 25^'\n~^ Var(/i, ^1^.^^))^'^ < 5 + 25^l\Lh{^) + 5f'\ 

Using (4.23) and (4.24), for large enough n, we thus have 

Var[(/i, u^^^^)] - L,(</>)| < 35 + 25^/\Lh{ct>) + 

and since (5 > is arbitrary, this shows that 

n"^Var[(/i,i/^^^^^)] ^L,,(0) asn^oo, 

which is the first part of the statement of Theorem 2.1. 

To prove the stated asymptotic normality of n~^^'^ {h,T'^ ^ it suffices to 
show that for any h G B{A), 

(4.25) n~y'(^h,vl^^,)^N{0,L,m. 

Let t £ M. Set X„ := n-^/^{h,vf^^^j^) and for e > set := n-^l^{h,vl^^, ^^) . 

Since (j)^ is in Proposition 4.2 shows that — > -^(0, Lh{(f>'^)) as n — > oo. 
Hence, 

(4.26) E[exp(itX^)] -exp(-t2L/,((^=)/2) ^0 asn^oo. 

Given 6 > 0, by Lemmas 4.2 and 4.6, we can choose e > such that for large 
n, 

E[| expiitXn) - eMitX'Jl] < E[|t(X„, - X^)|] < S 

and also |e~*^-^h("?^)/2 - e-*^^h('?^'')/2| < J so that combining with (4.26), we 
have for large n that 

\E[exp{itXn)] - e-*'^'^W/2| < 35 
and since 5 is arbitrary, this implies (4.25). □ 

5. Proofs of Propositions 2.1 and 2.2. 

5.L Proof of Proposition 2.1. First, we identify V0^i(/9) when IC^W^, 
which implies —y ^ fC for all y £ IC. The integral in (2.9) has contributions 
only from y G /C and from G (y + /C) , and these contributions are equal by 
a symmetry argument. Let bd := |i?f'(0)|. 
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Consider y & )C. Then \Ci{0,7iLly)\ has the distribution of min(ri, 
and \Ci{y,TC U 0)\ has the distribution of Fi, and they are independent. 
Hence, the integral in (2.9) is equal to 

2 / E[^{PT,)m^{Pmm{Ti,bd\y\'')) - HP^i)])dy 
JK 

roo 

= 2M^,i(/3) / E[(t}{(3mm{Ti,s))-4>{(3Ti)]ds 
■Jo 

= 2M^,i(/3)E r\<Pi(3s)-<Pi(3T,))ds. 
Jo 

In the last expectation, the first term is equal to (p{l3s)P\ri > s] which 
comes to M^^i(/3). The second term comes to M^^2{fi) as in (4.11). Thus, the 
integral in (2.9) is equal to to 2M(^^i(/3)(M^^i(/?) — M^ 2{P)) and substituting 
in (2.9) we find that K/)^i(/3) is given by case A; = 1 of formula (2.13) when 

To complete the proof of Proposition 2.1, we need to show that V^^fc is 
given by (2.13) in the case when d=l (for arbitrary k, but still assuming 
/C 7^ M"'). There are only two possibilities for JC and by symmetry it suffices to 
consider the case with /C = (0,oo). In this case, the expression (2.9) becomes 

/oo 
c^(0,y)dy 
-oo 

where 

c^(0,y) :=E[</.(/3C7o)0(/?C,)] - (E[<A(/3^fc)])^ 

where Co (resp., Cy) denotes the length of the /c-spacing starting at the ori- 
gin (resp., starting at y) with respect to the augmented point set W U U y. 
We proceed to evaluate the integral in (5.1). Write efc := E[(/)(/3rfc)]. Then 

c^(0, y) = E[{cl){f3Comf3Cy) - (l){(3Tk)ek){l{y < T^} + l{y > rj)] 

= E[{ct){pCo)Hf3Cy) - 4>{PTk)ek)l{y < T,.}]. 

Integrating over y and setting Tq := 0, we have that 

POO I ^ \ roo 

(5.2) c^{^,y)dy=\^ljy E[</.(/3rfc)ea{y < r,.}] dy, 

where we set 

/■oo 

:= E / WC^)^{l3Cy) ■ \{y < Tj}) dy. 

Recall that Tj = X]i=i We now compute Ij in the case with 1 < j < 
k — 1. For such j, if Fj_i < y <Tj then Cq = Tk-i and Cy = Tj+k-i — V, 
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setting w = y — Tj-i, we have for 1 < j < A; — 1 that 

Ij=eI%(p(tj_i + w + {Tij-w)+ Ti^\ 



i=j+i 



|^rij-'u;+^ Ti^ijjl{Tij>w}dw. 



Now take the expectation inside the integral. Since Tij is exponential, we 
have P\^i,j >w]= e~'^ , and by conditioning on this event, using the mem- 
oryless property of the exponential distribution and independence of Fi^- 
from the other random variables in the expression, we obtain 

/ / fc-i \ \ 



i=j+i / 
1 e""' dw. 



I I j+k-l 
\ \ i=j+l / / 

Now take the integral back inside the expectation. Letting Fi^o be a further 
independent exponential random variable with density function e~'^,w > 0, 
we have that 

ij = K 4> (^p (r,^i + ri,o + Tij + ^ Ti,,^ ^ 



(5.3) 



/ fk-l \ 
V \i=0 / 



/ /fc+j-1 ^ 



= E[(/)(/?rfc)0(/3(rfc+,-r,))]. 

To deal with 7^, we modify the preceding argument as follows. If F^^i <y < 
Tk, then Co = y and Cy = r2fc+i — y- Setting w = y — Ffc^i, we have that 

4 = HPi^k-i + w))^ l^p \r,^k - + E j j H^i,k > w} dw. 

Conditioning on the event that Fi ^ > w using the memoryless property of 
the exponential distribution and independence of Fi ^ from the other random 
variables in the expression, we obtain 

2k-l 



4 = E^"0(/3(Ffe_i + u;))0(/3(Fi,fc+ |] Fi,, 



' dw. 



GAUSSIAN LIMITS FOR GENERALIZED SPACINGS 



25 



Letting Fi^o be a further independent exponential random variable, we have 
that 



4 = E 



2k-l 



i=k+l 



f /k-1 N 

^ \i=o y 



/ /2k-l \ 

\ \i=k J 



4- 



Now as in (4.11) the last term in (5.2) is 

roc 

ek / E[(^(/3rfc)l{y < TJ] dy = A;efcE[0(/3rfc+i)]. 
Jo 

Combining this with the preceding expressions for < k) and for 1^, we 
may rewrite (5.2) as 

roo /k—l \ 

c^(0, y) dy = \^J2 E[(^(/3rfc)0(/3rfc+j - /3r,)] j + el - kckCk+i 

= (^YimiP^kHiPTk+j - PTj)] - el)^ + kekiek - e^+i) 

k~l 

= kek{ek - Cfc+i) + Y Gov {(j){(3Tk),(l)i(3Tk+j - (3Tj)). 
i=i 

By symmetry, for all (3, we have J^^c^{0,y) dy = c^{0,y) dy, and thus 
from (5.1) we obtain for all (3 > that V^^kiP) is given by (2.13). This 
completes the proof of Proposition 2.1. 



5.2. Proof of Proposition 2.2. We deduce Proposition 2.2 as follows. 
From the definition (2.9), we obtain 



y^,i(/3) = iVV,i(/3)+ / c(0,y)(iy, 

where c(0,y) := E[,/.(/3|Ci(0, W U y)|)0(/?|Ci(y, W U 0)|)] - m>{(3V{)\f. For 
all s,t eM+, let p(s,t) := P[|Ci(0,H U y)| > s, |Ci(?/, H U 0)| >t]. Then for 
all s,t G [0, lyl^'wrf], we have 



Otherwise, p{s,t) = 0. Hence, for y G M'^, by the fundamental theorem of 
calculus, the assumption that (f) is differentiable with limf|o </'(i) = 0, and 
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Fubini's theorem, 



c{0,y)=Ej^ /3V(/3s)(/.'(/3t)l{|Ci{o,wuy)| 

/ roc, 

E / (/?u)l{ri>.4(in 



>s,\Ci{y,H\JO)\>t} ds dt 



oo roo 



= / (j)'{(3s)(l)'{(3t) [p{s, t) - e-("+*)] ds dt. 

Jo JO 

Since p{s,t) vanishes whenever {s,t) ^ [0, |y|°'u;(i]^, we obtain 
Jo Jo 



0^ ct>'{l3s)cl)'{()t)e-^'+'Usdt. 

J J ma,x(s,t)>\y\'^uiii 



Therefore, 

c{0,y)dy 



JO 



/ / (l)'{(5s)(t)'{f3t)e-^'+^Usdtdy 

jRd J Jraax{s,t)>\y\'iu}a 

and letting u := \y\'^u>d, the above becomes 
c{0,y)dy 

/* OO PUf PUf 

= /3^/ / / ^'{/3s)cb'{pt)e-^'+'^[e^'^''''^''/'^''^'^'^ -l]dsdtdu 



JO JO 

oo 



// (p'{(3s)(t)'{pt)e-^'+^^ dsdtdu. 

Jo J Jmax{s,t)>u 

Finally, change the order of integration to obtain 
c{0,y)dy 

roo roo roo 

= /3M / (/)'(/3s)(/.'(/3t)e-(^+*) / {e^'^'^'^^^l^'^^'^ -\\dudsdt 

Jo Jo Jmax{s,i) 

/•oo /-oo /•max{s,t) 

-f\ \ (/)'(/3s)(/''(/3t)e-("+*W dudsdt, 
Jo Jo Jo 

which is exactly the desired limit. This proves Proposition 2.2. 
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